- 32 - 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: S tudiengesellschaf t Kohle mbH 

(B) STREET: Kaiser-Wilhelm- Platz 1 

(C) CITY: Muelheim an der Ruhr 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP) : 45470 

(ii) TITLE OF INVENTION: A Process for the Preperation and 
Identification of Novel Hydrolases Having Improved 
Properties 

(iii) NUMBER OF SEQUENCES: 21 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1,0, Version #1.3 0 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

GCGCAATTAA CCCTCACTAA AGGGAACAAA 3 0 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

GCGTAATACG ACTCACTATA GGGCGAA - 2 7 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 85 . .1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 . . 1017 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 6 0 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TAT CTG CTC CCC CTC 111 

Met Lys Lys Lys Tyr Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAG ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTC GGG GTC GAC TAC TGG TTC GGC ATT 2 55 

Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp Phe Gly He 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 3 03 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 3 51 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 

Val Glu Glu He Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu He 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr He Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ATC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 

Pro Asp Leu He Ala Ser Ala He Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 140 
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TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TCA CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Ser Leu Gly Ser Leu 

145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAG CCG 68 7 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 

160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 73 5 

Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 

180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 

195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 8 79 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 

225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 

Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 

240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 97 5 

Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 

260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 

Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 

275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 104 9 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Lys Lys Tyr Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 

-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 

-10 -5 15 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 



Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 
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Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu lie Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser His Gly Gly Pro 
75 80 85 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu lie Ala Ser Ala 
90 95 100 

lie Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 ISO 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 85 . . 1017 
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(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 . . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 6 0 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAG CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 

15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTC GGG GTC GAC TAC TGG TTC GGC ATT 2 55 

Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 

20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 3 03 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 

35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 

Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 

80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 49 5 

Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 

100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 

115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TCA CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Ser Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 6 87 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 

160 165 170 175 
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CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAG AAG GTC AAC 73 5 

Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAG TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 
Le\i Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 92 7 

Gly Met Val He Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 97 5 

Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 104 9 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala He Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 

Lys Tyr Pro He Val Leu Ala His Gly Met Leu Gly Phe Asp Asn He 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly He Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 60 65 70 



Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 
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Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85, .1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 . . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 6 0 
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CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 25 5 

Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GTC 3 03 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 3 51 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 

Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 

Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 
Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 73 5 

Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 
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CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 8 79 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 

225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAG CGG ATG AAC CAC CTG GAC GAG GTG 92 7 

Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 

240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 97 5 

Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 

260 265 270 

GTC TAG CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 

Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 

275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 104 9 



(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 

Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 

Lys Tyr Pro He Val Leu Ala His Gly Met Leu Gly Phe Asp Asn He 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly He Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 



Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 
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Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser Gly Gly Thr 
135 140 145 150 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly lie Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val lie Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 84 . . 1016 

(ix) FEATURE: 

(A) NAME/KEY: mat_pept ide 

(B) LOCATION: 162 - . 1016 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGATCCCCGG TTCTCCCGGA AGGATTCGGG CGATGGCTGG CAGGACGCGC CCCTCGGCCC 6 0 

CATCAACCTG AGATGAGAAC AAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 110 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 8 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 
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GCC AGC ACC TAG ACC CAG ACC AAA TAG CCC ATC GTG CTG GCC CAC GGC 2 06 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 IS 

ATG CTC GGC TTC GAG AAC ATC CTG GGG GTC GAG TAG TGG TTC GGC ATT 2 54 

Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp Phe Gly He 
20 25 30 

GCC AGC GCC TTG GGC CGT GAC GGT GCC CAG GTC TAG GTC ACC GAA GTC 3 02 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Val 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 3 50 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 98 

Val Glu Glu He Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu He 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAG GTC GCC GCC GTA CGT 446 

Gly His Ser His Gly Gly Pro Thr He Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 94 

Pro Asp Leu He Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 

100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 542 

Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GGG CTG ATC AGC 590 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 
130 135 140 

TTC CTT TCC AGC GGC AGC ACC GGT ACG CAG AAT TCA CTG GGC TCG CTG 63 8 

Phe Leu Ser Ser Gly Ser Thr Gly Thr Gin Asn Ser Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAG CCG 6 86 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGG GGC GAA GGC GCC TAG AAG GTC AAC 7 34 

Gin Gly He Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 

180 185 190 

GGC GTG AGC TAT TAG TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 82 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 83 0 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 878 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 
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GGC ATG GTG ATC CGC GAG AAC TAG CGG ATG AAC CAC CTG GAG GAG GTG 92 6 

Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 974 
Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAG CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1016 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC C 104 7 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 



-26 -25 



-20 



-15 



Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr 

-10 -5 1 



Tyr Thr Gin Thr 
5 



Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly 
10 15 



Phe Asp Asn lie 
20 



Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala 
25 30 



Leu Arg Arg Asp 
35 



Gly Ala Gin Val Tyr Val Thr Glu Val Ser Gin Leu Asp Thr Ser Glu 



40 



45 



50 



Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu 



55 



60 



65 



He Val Ala Leu 
70 



Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser 
75 80 



His Gly Gly Pro 
85 



Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu 
90 95 



He Ala Ser Ala 
100 



Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr 
105 110 



Ala Asp Phe Leu 
115 



Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 



120 



125 



130 



Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser 



135 



140 



145 



Ser Gly Ser Thr 
150 



Gly Thr Gin Asn Ser Leu Gly Ser Leu Glu Ser Leu 
155 160 



Asn Ser Glu Gly 
165 



- 44 - 



Ala Ala Arg Phe 
170 

Cys Gly Glu Gly 
185 

Ser Gly Ser Ser 
200 

Leu Gly Ala Ser 
215 

Leu Val Gly Thr 



Tyr Arg Met Asn 
250 

Ser Leu Phe Glu 
265 

Arg Leu Lys Asn 
280 



Asn Ala Lys Tyr 



Ala Tyr Lys Val 
190 

Pro Leu Thr Asn 
205 

Ser Leu Thr Phe 
220 

Cys Ser Ser His 
235 

His Leu Asp Glu 



Thr Ser Pro Val 
270 

Ala Ser Leu 
285 



Pro Gin Gly lie 
175 

Asn Gly Val Ser 



Phe Leu Asp Pro 
210 

Lys Asn Gly Thr 
225 

Leu Gly Met Val 
240 

Val Asn Gin Val 
255 

Ser Val Tyr Arg 



Pro Thr Ser Ala 
180 

Tyr Tyr Ser Trp 
195 

Ser Asp Ala Phe 



Ala Asn Asp Gly 
230 

lie Arg Asp Asn 
245 

Phe Gly Leu Thr 
260 

Gin His Ala Asn 
275 



(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85. .1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 , .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 6 0 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 



ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 



255 



- 45 - 



CCC AGC GCC TTG CGC CGT GAG GGT GCC GAG GTC TAG GTC ACC GAA GGC 3 03 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC GAG TTG GAG ACC TCG GAA GTC CGC GGC GAG GAG TTG CTG CAA GAG 3 51 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC GTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 

Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAG GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 

Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTG CTG GGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC GTC TCC GGG CTG GTC AAC AGC CTG GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 

130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAG CCG 6 87 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAG AAG GTC AAC 73 5 

Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAG TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTG GAT CCG AGC GAC GCC TTC GTC GGC GCC TCG TCG CTG ACC TTC AAG 831 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 

210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 8 79 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC GGC GAC AAC TAG GGG ATG AAC CAC CTG GAC GAG GTG 92 7 

Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 
Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 



- 46 - 



GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 
Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 



1017 



TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CO 



1049 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala lie Gly Leu Ala 
-26 -25 -20 -15 



Ser Leu Ala Ala Ser Pro Leu lie Gin Ala Ser Thr Tyr 
-10 -5 1 

Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe 
10 15 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu lie 
55 60 65 

Ser Gly Gin Pro Lys Val Asn Leu lie Gly His Ser Hif 
75 80 

Thr lie Arg Tyr Val Ala Ala Val Arg Pro Asp Leu lie 
90 95 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala 
105 110 115 

Arg Gin lie Pro Pro Gly Ser Ala Gly Glu Ala Val Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu lie Ser Phe Leu Ser Ser 
135 140 145 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn 
155 160 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro 
170 175 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser 
200 205 210 



Thr Gin Thr 
5 

Asp Asn He 
20 

Arg Arg Asp 

Thr Ser Glu 

Val Ala Leu 
70 

Gly Gly Pro 
85 

Ala Ser Ala 
100 

Asp Phe Leu 

Ser Gly Leu 

Gly Gly Thr 
150 

Ser Glu Gly 
165 

Thr Ser Ala 
180 

Tyr Ser Trp 
Asp Ala Phe 



- 47 - 



Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val lie Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85 . . 1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 . .1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 60 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAG ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 2 55 

Met Leu Gly Phe Asp Asn He Leu Gly Val Asp Tyr Trp Phe Gly He 
20 25 30 

CCC AAC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 3 03 

Pro Asn Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 351 
Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 



- 48 - 



GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 

Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 4 95 

Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 
130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 
145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 687 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 
160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCT TAC AAG GTC AAC 73 5 

Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 
210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 87 9 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 
225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 92 7 

Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 
240 245 250 255 

AAC CAG GTC CTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 

Asn Gin Val Leu Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 

Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 - 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CCG 105 0 



- 49 - 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

M.t .v. .y. ser .eu Leu Pro Leu Gly Leu He aly I-- 

11 Z Al. Al. ser pre Leu He Oln Al. ser Thr Tyr T.r 01„ Thr 

-10 ^ 

Lye Tyr Pro II. v.l Leu Al. His oly Met Leu uly Ph. Aep As„ He 

Leu Gly V,l Z Tyr Trp Ph. Gly He Pro A,n Al. L.u Arg Arg A.p 

25 

Oly .la Gin val Tyr Val Thr Olu Oly Ser Gin .eu Asp Thr Ser Glu 
Val Z Gly Glu Gin .eu .eu Gin Gin Val Glu Glu He Val Ala .eu 

sir Gly Gin Pro .ys Val Asn .eu He Gly His Ser His Gly Gly Pro 

75 

Thr II. Arg Tyr v.l Al. Al. V.l Arg Pro A.p Leu He Al. Ser Al. 
Thr ser v.l Gly Al. Pro His Lys Gly ser Asp Thr Al. Asp Phe Leu 
„g Glu Ii! pro pro Gly ser Al. Gly Glu Al, v.l L.u ser Gly L.u 

120 ^-^^ 

^1 Tvi, T^i, Tie Ser Phe Leu Ser Ser Gly Gly Thr 
Val Asn ser Leu Gly Ala Leu lie ser t'ne 

14 0 -L*± -» 

Tiy Thr Gin Asn .eu Leu Gly Ser I.eu Glu Ser Leu Asn Ser Glu Gly 

155 

Al. Al. Ar. Ph. A.„ Al. Lys Tyr Pro Glu Gly He Pro Thr S.r Al, 

cys Gly Gl. Tiy Al. Tyr Lys val Asn Gly v.l ser Tyr Tyr ser Trp 

185 

Tr... Thr- Asn Phe Leu Asp Pro Ser Asp Ala Phe 
ser Gly Ser Ser Pro Leu Thr Asn Pne u y 

200 

L.U Gly Al. ser ser Leu Thr Phe Lys Asu Gly Thr Al. As„ A.p Gly 

IZ v.l Gly Thr cys Ser ser His Leu Gly Met v.l He Ar, Asp Asu 

235 



- 50 - 



Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Leu Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85. ,1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 - . 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 6 0 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAC ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 2 55 

Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 3 03 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 3 51 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
5 0 5 5 6 0 



GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 
Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 



399 



- 51 - 



GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAG GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 

80 85 90 95 

CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AGG GGT 4 95 

Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Arg Gly 

100 105 110 

TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin lie Pro Pro Gly Ser Ala Gly 

115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu lie Ser 

130 135 140 

TTC CTT TCC AGC GGC GGC ACC GGT ACG CAG AAT TTA CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly Thr Gly Thr Gin Asn Leu Leu Gly Ser Leu 

145 150 155 

GAG TCG CTG AAC AGT GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 6 87 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 

160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCT TAC AAG GTC AAC 735 

Gin Gly lie Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 

180 185 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 

195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 

210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 879 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 

225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 92 7 

Gly Met Val lie Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 

240 245 250 255 

AAC CAG GTC CTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 97 5 

Asn Gin Val Leu Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 

260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 

Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 

275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 104 9 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



- 52 - 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala He Gly Leu Ala 

-26 -25 

ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser Thr Tyr Thr Gin Thr 

-10 -5 

Lys Tyr Pro He Val Leu Ala His Gly Met Leu Gly Phe Asp Asn He 

10 1^ 
I.eu Gly val Asp Tyr Trp Phe Gly He Pro Ser Ala Leu Arg Arg Asp 
25 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 

val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 

55 60 
ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 

75 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 

Thr ser Val Gly Ala Pro His Arg Gly Ser Asp Thr Ala Asp Phe Leu 

105 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 

120 

val Asn ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Gly Thr 
135 

Gly Thr Gin Asn Leu Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 

155 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 

170 -^^^ 
Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 

185 '^^^ 
ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 

200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 22b 

.eu val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 

235 240 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Leu Gly Leu Thr 

250 255 
ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 



265 270 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



- 53 - 



(2) INFORMATION FOR SEQ ID NO : 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85. .1017 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 163 1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GGATCCCCCG GTTCTCCCGG AAGGATTCGG GCGATGGCTG GCAGGACGCG CCCCTCGGCC 6 0 

CCATCAACCT GAGATGAGAA CAAC ATG AAG AAG AAG TCT CTG CTC CCC CTC 111 

Met Lys Lys Lys Ser Leu Leu Pro Leu 
-26 -25 -20 

GGC CTG GCC ATC GGT CTC GCC TCT CTC GCT GCC AGC CCT CTG ATC CAG 15 9 

Gly Leu Ala lie Gly Leu Ala Ser Leu Ala Ala Ser Pro Leu lie Gin 
-15 -10 -5 

GCC AGC ACC TAG ACC CAG ACC AAA TAC CCC ATC GTG CTG GCC CAC GGC 2 07 

Ala Ser Thr Tyr Thr Gin Thr Lys Tyr Pro lie Val Leu Ala His Gly 
15 10 15 

ATG CTC GGC TTC GAC AAC ATC CTT GGG GTC GAC TAC TGG TTC GGC ATT 255 
Met Leu Gly Phe Asp Asn lie Leu Gly Val Asp Tyr Trp Phe Gly lie 
20 25 30 

CCC AGC GCC TTG CGC CGT GAC GGT GCC CAG GTC TAC GTC ACC GAA GGC 3 03 

Pro Ser Ala Leu Arg Arg Asp Gly Ala Gin Val Tyr Val Thr Glu Gly 
35 40 45 

AGC CAG TTG GAC ACC TCG GAA GTC CGC GGC GAG CAG TTG CTG CAA CAG 3 51 

Ser Gin Leu Asp Thr Ser Glu Val Arg Gly Glu Gin Leu Leu Gin Gin 
50 55 60 

GTG GAG GAA ATC GTC GCC CTC AGC GGC CAG CCC AAG GTC AAC CTG ATC 3 99 

Val Glu Glu lie Val Ala Leu Ser Gly Gin Pro Lys Val Asn Leu lie 
65 70 75 

GGC CAC AGC CAC GGC GGG CCG ACC ATC CGC TAC GTC GCC GCC GTA CGT 44 7 

Gly His Ser His Gly Gly Pro Thr lie Arg Tyr Val Ala Ala Val Arg 
80 85 90 . 95 



CCC GAC CTG ATC GCT TCC GCC ACC AGC GTC GGC GCC CCG CAC AAG GGT 
Pro Asp Leu lie Ala Ser Ala Thr Ser Val Gly Ala Pro His Lys Gly 
100 105 110 



495 



- 54 - 



TCG GAC ACC GCC GAC TTC CTG CGC CAG ATC CCA CCG GGT TCG GCC GGC 54 3 

Ser Asp Thr Ala Asp Phe Leu Arg Gin He Pro Pro Gly Ser Ala Gly 
115 120 125 

GAG GCA GTC CTC TCC GGG CTG GTC AAC AGC CTC GGC GCG CTG ATC AGC 5 91 

Glu Ala Val Leu Ser Gly Leu Val Asn Ser Leu Gly Ala Leu He Ser 

130 135 140 

TTC CTT TCC AGC GGC GGC ATC GGT ACG CAG AAT TTT CTG GGC TCG CTG 63 9 

Phe Leu Ser Ser Gly Gly He Gly Thr Gin Asn Phe Leu Gly Ser Leu 

145 150 155 

GAG TCG CTG AAC AGC GAG GGT GCC GCG CGC TTC AAC GCC AAG TAC CCG 68 7 

Glu Ser Leu Asn Ser Glu Gly Ala Ala Arg Phe Asn Ala Lys Tyr Pro 

160 165 170 175 

CAG GGC ATC CCC ACC TCG GCC TGC GGC GAA GGC GCC TAC AAG GTC AAC 735 

Gin Gly He Pro Thr Ser Ala Cys Gly Glu Gly Ala Tyr Lys Val Asn 
180 185 ■ 190 

GGC GTG AGC TAT TAC TCC TGG AGC GGT TCC TCG CCG CTG ACC AAC TTC 7 83 

Gly Val Ser Tyr Tyr Ser Trp Ser Gly Ser Ser Pro Leu Thr Asn Phe 
195 200 205 

CTC GAT CCG AGC GAC GCC TTC CTC GGC GCC TCG TCG CTG ACC TTC AAG 831 

Leu Asp Pro Ser Asp Ala Phe Leu Gly Ala Ser Ser Leu Thr Phe Lys 

210 215 220 

AAC GGC ACC GCC AAC GAC GGC CTG GTC GGC ACC TGC AGT TCG CAC CTG 8 79 

Asn Gly Thr Ala Asn Asp Gly Leu Val Gly Thr Cys Ser Ser His Leu 

225 230 235 

GGC ATG GTG ATC CGC GAC AAC TAC CGG ATG AAC CAC CTG GAC GAG GTG 927 

Gly Met Val He Arg Asp Asn Tyr Arg Met Asn His Leu Asp Glu Val 

240 245 250 255 

AAC CAG GTC TTC GGC CTC ACC AGC CTG TTC GAG ACC AGC CCG GTC AGC 975 

Asn Gin Val Phe Gly Leu Thr Ser Leu Phe Glu Thr Ser Pro Val Ser 
260 265 270 

GTC TAC CGC CAG CAC GCC AAC CGC CTG AAG AAC GCC AGC CTG 1017 

Val Tyr Arg Gin His Ala Asn Arg Leu Lys Asn Ala Ser Leu 
275 280 285 

TAGGACCCCG GCCGGGGCCT CGGCCCGGGC CC 104 9 



(2) INFORMATION FOR SEQ ID NO : 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Lys Lys Lys Ser Leu Leu Pro Leu Gly Leu Ala He Gly Leu Ala 
-26 -25 -20 -15 



Ser Leu Ala Ala Ser Pro Leu He Gin Ala Ser Thr Tyr Thr Gin Thr 
-10 -5 15 
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Lys Tyr Pro lie Val Leu Ala His Gly Met Leu Gly Phe Asp Asn lie 
10 15 20 

Leu Gly Val Asp Tyr Trp Phe Gly lie Pro Ser Ala Leu Arg Arg Asp 
25 30 35 

Gly Ala Gin Val Tyr Val Thr Glu Gly Ser Gin Leu Asp Thr Ser Glu 
40 45 50 

Val Arg Gly Glu Gin Leu Leu Gin Gin Val Glu Glu He Val Ala Leu 
55 60 65 70 

Ser Gly Gin Pro Lys Val Asn Leu He Gly His Ser His Gly Gly Pro 
75 80 85 

Thr He Arg Tyr Val Ala Ala Val Arg Pro Asp Leu He Ala Ser Ala 
90 95 100 

Thr Ser Val Gly Ala Pro His Lys Gly Ser Asp Thr Ala Asp Phe Leu 
105 110 115 

Arg Gin He Pro Pro Gly Ser Ala Gly Glu Ala Val Leu Ser Gly Leu 
120 125 130 

Val Asn Ser Leu Gly Ala Leu He Ser Phe Leu Ser Ser Gly Gly He 
135 140 145 150 

Gly Thr Gin Asn Phe Leu Gly Ser Leu Glu Ser Leu Asn Ser Glu Gly 
155 160 165 

Ala Ala Arg Phe Asn Ala Lys Tyr Pro Gin Gly He Pro Thr Ser Ala 
170 175 180 

Cys Gly Glu Gly Ala Tyr Lys Val Asn Gly Val Ser Tyr Tyr Ser Trp 
185 190 195 

Ser Gly Ser Ser Pro Leu Thr Asn Phe Leu Asp Pro Ser Asp Ala Phe 
200 205 210 

Leu Gly Ala Ser Ser Leu Thr Phe Lys Asn Gly Thr Ala Asn Asp Gly 
215 220 225 230 

Leu Val Gly Thr Cys Ser Ser His Leu Gly Met Val He Arg Asp Asn 
235 240 245 

Tyr Arg Met Asn His Leu Asp Glu Val Asn Gin Val Phe Gly Leu Thr 
250 255 260 

Ser Leu Phe Glu Thr Ser Pro Val Ser Val Tyr Arg Gin His Ala Asn 
265 270 275 

Arg Leu Lys Asn Ala Ser Leu 
280 285 



(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : unknown 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCGCAATTAA CCCTCACTAA AGGGAACAAA 



(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGTACGCAGA ATNNNCTGGG CTCGC 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GCGTAATACG ACTCACTATA GGGCGAA 



